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1 Introduction 

The Shatters relation and the VC dimension have been investigated since the early sev- 
enties (starting with IT7l[T8l[T9l '). These concepts have found numerous applications 
in statistics, combinatorics, learning theory and computational geometry. 

Shattering extremal systems are set-systems with a very rich structure and many dif- 
ferent characterizations. The goal of this paper is to elaborate on the structure of these 
systems. They were discovered several times and independently by several groups of 
researchers. Lawrence in lfl"2l is the first who introduced them in his study of con- 
vex sets. Intrestingly, the definition he gave does not require the concept of shatters. 
Independently, Bollobas et al in (5] have discovered these systems, using the shatters 
relation (a.k.a traces); furthermore, Bollobas et al are the first who introduced the rela- 
tion of strongly- shatters (a.k.a strongly-traces) and characterized Shattering extremal 
systems with the shatters and strongly-shatters relations. Dress et al, independently of 
Bollobas et al, have discovered the same characterization and established the equiva- 
lence to the characterization given by Lawrence. 

It is an interesting phenomena that, in the past thirty years, different characteriza- 
tions of these systems were discovered in different fields of pure and applied mathe- 
matics. Strangely, there seem to be no connections between the different groups that 
studied these systems. We hope to link together these different groups by this work 
and as a result to enhance the research of these systems. Here is a list of some of the 
contexts in which these systems were discovered: Functional analysis ff3l . Discrete- 
geometry ( lfl2lD . Phylogenetic Combinatorics (J5J and Extremal Combinatorics 
(JUS)- Moreover, as this work shows, the class of Shattering extremal systems nat- 
urally extends the class of Maximum system^. Maximum systems occur in Geometry 
(HI and in Learning theory l6l ITTl [161 [141 \l5l . Thus, it is not unlikely to find usage of 
Shattering extremal systems in these fields. 

1.1 Our results 

We present new definitions of shatters and strongly-shatters that differ from each other 
only in the order of the quantifiers. We demonstrate that at least some of the known 
duality between these two concepts is due to this transpose of quantifiers. We shed ad- 
ditional light on this mysterious phenomena of duality via a mechanical transformation 
that, sometimes, translates claims to dual claims and proofs to dual proofs. 

Two unary operators, which we denote (+) and l+J, were extensively used under many 
different notations in the literature. These operators preserve the property of shattering 
extremality and the property of being a maximum system. Hence, these operarators 
were studied in both contexts. Many previous works observed a certain duality between 
these two operators. We demonstrate that this duality stems from the famous duality 
of Boolean algebra. Moreover, we reveal that a previously known lemma concerning 
duality is just the De-Morgan laws. 

We present the famous down shifting operators in the terminology of oblivious 
sorting algorithms. The most common use of these operators is to apply a sequence 
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of them on a system until a fixed point is reached. Our presentation of down-shifting 
sheds light on this process and relates it to a basic problem in the theory of sorting. 

We introduce new characterizations of the shatters and strongly shatters relations. 
One characterization is by the operators (+) and l+J and the other characterization is 
by down-shifting operators. As a corollary we obtain simple proofs to two known 
characterizations of Shattering extremal systems via commutativity between [+) and l+J 
and commutativity between down-shifting operators. Moreover, we introduce a weaker 
form of Shattering-Extremality and establish equivalences of this form with a weaker 
form of the above commutativities. 

We show that, similarly to Maximum systems, Shattering extremal systems occur 
naturally in Geometry as a combinatorial feature of an arrangement of oriented hyper- 
planes. We give a geometrical interpretation of shattering and strong-shattering in 
this setup. We also introduce the class of "Convex systems" which form a simple and 
natural generalization of the classes studied by Welzl (|8j) and of the classes studied 
by Lawrence (O). We pose as an open question whether every Shattering Extremal 
system is a convex system.. 

Finally, we use this theory to develop a machinery for proving certain equalities 
and inequalities concerning undirected graphs and their orientations. As an example of 
this machinery we will prove the following inequality: 

Let G = {V,E) be an undirected graph and let s,t be two distinct vertices in V. Then, 
the number of orientations of E that contain a directed path from s to t is the same as 
the number of subgraph^ of G that contain an undirected s — t path. 
One advantage of this machinery is that it demonstrates an application of Strong shat- 
tering which is exceptional since most applications of this theory use shattering. 
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2 Shattering Extremal systems - A first date 

2.1 Basic definitions 

This work studies finite set-systems and thus all structures discussed in this paper are 
finite. A standard definition of a set-system is a pair (T,X) where X is a set and T C 
!P(X). For reasons that will become clear later, we choose a different definition for set- 
systems. Also, we prefer to call these structures "systems" rather than "set-systems". 

Definition 2.1 (System). A system is a pair, 8 = (S, {0, 1} X ), where X is a set and 
SC{0,1 } x . For a system § = (S, {0, 1 } x ): 

• Let S(S), dim(S) and C(S) denote S, X and {0, 1 } x respectively. 

• Let |S| denote \S\. 

• The complement o/S, denoted S, is the system defined by: 

S '= {C{S)-S(S),C(S)). 

• The elements o/C(S) are called vertices 

Note that for every set X there are exactly 2 2 '*' systems, S, with C(S) = {0, 1 } x . 
In particular, there are exactly two systems for X = 0, one of them is empty and the 
other contains one vertex. Let % denote the former and let "Kj denote the latter system. 

Definition 2.2. Let f,g be arbitrary functions and let A C Dom(f) C\Dom(g). 

• / agrees with g on A means that f(x) = g(x) for all x G A. 

• / agrees with g means that f agrees with g on Dom(f) C\Dom{g) 

Definition 2.3 (Cube). Let X be a set and let Y C X. A F-cube o/{0, 1} Z is an equiv- 
alence class of the following equivalence relation on {0,1}^: "u agrees with v on 
X-Y" 

• Let C be a Y-cube of {0, 1 } x . The dimensions of C are defined to be the set Y 

and denoted by dim(C). 

Note that there are 2l x ~ F l F-cubes, they are disjoint and cover all of {0, 1} X . 

2.2 Lopsided systems 

Lawrence introduced the concept of lopsided systems as follows 

Definition 2.4 (Lawrence IT2| ). A system S is called lopsided if for every {X',X"}, a 
partitioning o/dim(S), exactly one of the following statements is true: 

1. S(S) contains an X 1 -cube 

2. S{S) contains a X" -cube 

The following important property of lopsided set-sytems is an immediate corollary 
of the definition: 

Theorem 2.5 (Complementing). A system, S, is lopsided if and only if^E> is lopsided. 
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2.3 Shatters and Strongly-Shatters 

Definition 2.6 (Merging). Let f e {0, 1 } x and g e {0, 1 } Y where X,Y are disjoint. We 
define f -kg to be the unique function in {0, 1 } XUY that agrees with both f and g. 

Note that * is a commutative and associative operation. 
Let § be a system, let X = dim(S) and let Y C dim(S). 

Definition 2.7 (Shatters). We say that S shatters Y if 

(V/e{0,if)(3ge{0,i}*- 7 ):g*/eS(§) 
Definition 2.8 (Strong-shatters). We say that § strongly shatters Y if 

(3ge{o,i} x - r We{o,i} T ):g*fes 

That is, our definitions of strongly shatters and shatters are identical except of the 
order of the two quantifiers. These definitions highlight the duality that was observed 
by previous works. For example, a straightforward application of predicate calculus on 
the above definitions gives the next lemma: 

Lemma 2.9 (0Q]). Let S be a system and let {X',X"}, be a partitioning of dim(S). 
Then exactly one of the following statements is true: 

1. § shatters X' 

2. strongly shatters X" 

Note the similarity between Lemma [2791 and definition 12.41 That is, the former is 
derived from the latter by replacing "shatters" in Statement 1 with "strongly-shatters". 
A natural variant of "Lopsided" is the following concept: 

Definition 2.10 (Dual lopsided system). A system § is called Dual lopsided if for every 
{X',X"}, a partitioning o/dim(§), exactly one of the following statements is true: 

1. § shatters X' 

2. shatters X" 

Two subsets of !P(dim(§)) are associated with S: 

clef 

• str(S) = {Y C dim(S) | Y is shattered by §} 

clef 

• sstr(S) = {Y C dim(S) | Y is strongly shattered by §} 

Obviously, both str(S) and sstr(S) are closed under subset relation and sstr(S) C str(S). 
This work studies systems for which sstr(S) = str(S). 
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2.4 Shattering extremal systems 

Definition 2.11 (Shattering-Extremal system). § is Shattering-Extrema0 ( in abbre- 
viation: SE) if it satisfies 

sstr(S) =str(S). 

The property "§ is SE" is sometimes refered to by the expression "SE(S) ". 

2.5 Shattering extremality = lopsidedness 

This section proves the following theorem: 

Theorem 2.12. The following statements are equivalent for a system §: 

1. SE(S) 

2. SE(S) 

3. 8 is lopsided 

4. § is dual lopsided 

The equivalence of the first three statements was proven by Dress et al in 0. 
Lemma 2.13. Let § be a system. S is lopsided <^=> sstr(§) = str(§) 
Proof. 

Lopsided(S) VX g fP(dim(S)) : dim(S) -X <£ sstr(^§) «Xe sstr(S) by definition 
<^=> VX e fP(dim(S)) : X e str(S) Me sstr(S) by LemmaEH 

<^> sstr(S) = str(S) . 

□ 

As noted earlier, there is a certain duality in this theory between shattering and 
strong shattering. This duality is manifested by a certain mechanical tranformation on 
text written in "mathematical english". The texts of interest are lemmas and proofs and 
we refer to the transformed texts as dual lemmas and dual proofs. This dual transfor- 
mation swaps few words and symbols as follows: It swaps the pair "str" and "sstr", the 
pair "Lopsided" and "Dual lopsided", the pair "C" and "3" and the pair "<" and ">". 
This dual transformation is useful since, sometimes, the dual of a true lemma is true. 
Moreover, sometimes the dual of a proof is a valid proof of the dual lemma. Lemma 
I2.13l and its proof are an example to this phenomenon. 

Lemma 2.14 (Dual of Lemma l2.13b . Let § be a system. Then 

S is dual lopsided ■<=>• sstr(S) = str(S). 

It is easy to verify that the dual of the proof of Lemma 12.131 is a valid proof of 
Lemma l2.14l 

These two lemmas establish equivalence between items 1,3,4 in Theorem 12. 121 
The rest of the theorem is derived from Theorem l2.5l 

3 This is not the standard definition of Shattering Extremal systems. The standard definition and its 
equivalence to this definition will be discussed in Section fO] 
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3 Inequalities 

This chapter presents basic inequalities which involve the concepts of Shattering and 
Strong-Shattering and discusses their relation with SE systems. 



3.1 VC-dimension and Sauer's lemma 

Definition 3.1 (VC-dimension). The VC-dimension (Vapnik Chervonenkis dimension) 
of a system S, denoted vc(S), is the cardinality of the largest subset that is shattered 
by it. 

The well-known "Sauer lemma" was proved in the 70's by Sauer ( ifTTl ). Shelah 
(QH) and Vapnik and Chervonenkis (lfl9l): 

Theorem 3.2 (Sauer's lemma). Let § be a system with | dim(S) | = n. Then 




A proof for this lemma will be given in the next section. 

3.2 The Sandwich theorem 

The following generalization of Sauer's lemma is the result of an accumulated work 
by several authors. Different parts of this theorem were proven independently several 
times. (Pajor lfl3l . Bollobas and Radcliffe Q, Dress 15] and Holzman and Aharoni 

IB 13) 

Theorem 3.3 (Sandwich theorem [13]|3]|5]E|9)). For a system §: 

|sstr(S)| < |S| < str(S) 

This section discusses and proves this theorem. The following trivial fact will be 
useful. For any a',b',a",b" e M: 

a'<b'md a" <b" and a' + b' =a" + b" =>• a = b' and a" = b" (1) 



4 As a special case, vc(S) = — 1 when S(S) = 
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Lemma 3.4. Let S, §',§" be systems that satisfy the following: 

1. |S'| + |S"| = |S| 

2. |str(S')| + |str(S")| < |str(S)| 

3. |§'| < |str(S')|, |S"| < |str(S")| 
Then: 

(a) |S| < |str(S)| 

(b) |S| = |str(S) | Assumptions 2 and 3 ;« f/ze premise of the lemma are satisfied 
with equality 

Remark: Assumption 3 is always true as will be proved later 
Proof. Regarding Conclusion (a): 

|str(S)| > |str(S')l + |str(S")| by Assumption 2 

>|S'| + |S"| by Assumption 3 

= |S| by Assumption 1 

Considering Conclusion (b): The right to left direction is trivial. Assume |S| = |str(S)|. 
Therefore, the above chain of inequalities holds when the symbol '>' is replaced with 
'='. This implies equality in Assumption 2. Fact (Q]i implies equality in Assumption 
3. □ 

Definition 3.5. Let § be a system, let x € dim(S). The restrictions of S associated 

with x are the following two systems: 

• ({/r«Hm(S)-W = /GS,/W-0},{0,l} dim < s )-W) 

• ({/WO-M = /GS,/W = l},{0,l} dim ( § )-W) 

Let § be a system, let x € dim(§) and let {§',§"} be the pair of restrictions of S 
associated with x. We will prove that |S| < |str(S)| by showing that these particular 
§, §', S" satisfy the premise of Lemma [3~4l 

Definition 3.6. Let A,B be sets and x be an object. Let (B x denote the following oper- 
ator: 

A® X B c = AUBu{cU{x} : ceADB} 

The following two lemmas are straightforward. 

Lemma 3.7. Let A,B, be sets and let x be an object such that x ^ T for every T £ A UB. 
Then 

\A® X B\ = \A\ + \B\ 
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Lemma 3.8. Let § be a system, let x £ dim(§) and let {§',§"} be the pair of restrictions 
ofS associated with x. Then: 

str(§')©.vStr(S") Cstr(S) 
Lemma 3.9. Let § be a system. Then: 

|S|<|str(S)|. 

Proof. We prove the claim by induction on dim(S). The case of dim(S) = is trivial. 
Otherwise, pickx 6 dim(S) and let {§',§"} be the pair of restrictions of § associated 
with x. It is enough to show that §',S" satisfy the three Assumptions of Lemma [3~4l 
Assumption 1 is immediate. Assumption 2 follows from Lemmas l3.7l and [3~8l Assump- 
tion 3 follows by the induction hypothesis. □ 

To see that Lemma [3~9l generalize Sauer lemma, note that by the definition of vc 
dimension: 

str(S) C {Y C dim(§) : |T|<vc(§)}. 

Hence, Sauer lemma (Theorem l3.2b is an easy conclusion of Lemma [3~9l 

To establish the Sandwich theorem it remains to show that |sstr(§)| < \E>\ and this 
is accomplished via applying the Duality tranformation on the proofs of Lemmas 13.41 
and l3.8l and |3.9l It is easy to verify that the duals of our proofs for these lemmas are 
valiqj. The dual of Lemma |3~9l gives the desired inequality, namely, 

|sstr(S)| < |S|. 

This finishes the proof of Theorem |3.3l 

Our proof for the Sandwich Theorem demonstrates the usefulness of the Duality 
Transformation. However, it is important to write some lines about the fragile nature 
of this tranformation. This transformation works only for some lemmas/theorems. It 
is not hard to find (true) claims whose dual claims are not true. Moreover, as we will 
see in the next section (Lemma |3. lit , there are some cases in which the dual of a 
lemma is true but the dual of the presented proof is not valid. Thus, when using this 
transformation, one has to take special care and to check the validity of the arguments 
being transformed. 

3.3 SE systems and the Sandwich theorem 

The inequalities in the sandwich theorem suggest two more extremal properties of sys- 
tems, namely: 

• Systems § for which sstr(S) = |§ 

• Systems § for which \E>)\ = str(S) 

5 Note that the Dual transformation of a proof transforms only the text explicitly written. It does not 
translate implicit arguments 
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The following result concerning the equivalence of these extremal properties was 
proven independently in (2), and in 0: 

Theorem 3.10. Let § be a system. The following statements are equivalent: 

1. sstr(S) = str(S) 

2. |sstr(S)| = |S| 

3. |S| = |str(S)| 

This section discusses and proves Theorem l3.10l 

Lemma 3.11. For every system S: |str(8)| = |§ sstr(S) = str(S) 

Proof. The lemma is proven by induction on |dim(S)|. Let B g str(S). It is enough to 
show that B 6 sstr(S). The case of B = dim(S) is easy. Otherwise, pick x ^ B and let 
S',S" be the two restrictions of S associated with x. By Lemma [3~4l §',§" satisfy the 
premise of the current lemma. By the induction hypothesis: 



sstr(S') = str(S') 

sstr(S") = str(S") 

Moreover, Lemma U4l also implies that |str(S')| + | str(S")| 
with Lemmas l3.7l and [378l implies that 

str(S')e J[ str(S") = str(S) 



(2) 
(3) 

|str(S)|. This, combined 
(4) 



Therefore: 

B g str(§) 



Bestr(S')®xStr(S") 
Bgstr(S')Ustr(S") 
Besstr(S')Usstr(S") 
B g sstr(S) . 



by Equation 2] 
since x £ B 
by Equations (f2]) and ([3]l 



□ 



The dual of Lemma l3.11l is true. However, we dont know a proof for Lemma [3.1 II 
whose dual is valic0. Therefore, we present a different proof for the dual lemma. 

Lemma [Z9l implies that for all S: |str(S)| + |sstr(-.§)| = 2l dim ( s >l Clearly, \S\ + 
|_,g| =2 |dta(S)l, This implies: 

Lemma 3.12. For every system §, the following statements are equivalent: 

1. |S| = |str(S)| 

2. |sstr(-.S)| = |-S| 



6 The dual of our proof of Lemma D.lll is not valid as observed, for example, by the dual of the second 
line in the proof: "Let B £ sstr(S). It is enough to show that B G str(S)... ". This argument is obviously 
wrong. 
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Lemma 3.13 (Dual of Lemma [3.1 U . For every system §: 

|sstr(S)| = \S\ => sstr(S) = str(S). 

|sstr(§)| = |S| =S> |str(-.S)| = h§| by LemmaEE] 

=^> sstr(^S) = str(-.S) by Lemma[3Ti] 

=>■ sstr(S) = str(S) by TheoremEE] 

□ 

The Sandwich Theorem and Lemmas |3 . 11113. 1 31 finish the proof of Theorem l3.10l 
By Theorem l3.10l it follows that any of the 3 properties mentioned there, may serve 
as a definition of Shattering Extremality . 

The most common definition in the literature is |§ = |str(S)|. ( See J3][T1[9)) 
We prefer sstr(S) = str(S) as this definition is invariant under duality. 

3.4 A recursive characterization of SE 

Theorem 3.14. Let Sbe a system, let x G dim(S) and let {§',§"} be the pair of restric- 
tions ofS associated with x. Then the following statements are equivalent: 

1. SE(S). 

2. SE{S') andSE(§") and str(S') ® A -str(§") = str(S). 

3. SE{S') andSE(S") and sstr(S') ®. v sstr(§") = sstr(S). 

Proof. The proof is divided into two parts. 

First part: 1 <S=^ 2. By Lemmas l3T7l and l3.8l it follows that 

str(S')e,str(S") = str(S) |str(S')| + |str(S")| = |str(S)|. 

This, combined with Conclusion (b) of Lemma [3~4l and Theorem 13. 101 concludes the 
first part of this proof. 

Second part: 1 3. This part is the dual of the first part. It is easy to verify that 
the dual of the proof given for the first part is valid. □ 

3.5 Maximum systems 

The class of Maximum systems is the "famous relative" of the class of SE-systems. 

Definition 3.15 (Maximum systems). S is called a Maximum system if it meets 
Sauer's lemma with equality: 

vc(S) / x 
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Maximum systems and their applications in machine learning have been discussed 
in |[T6l [T4l [T5l [TT1 [6). The relation between maximum systems and arrangements of 
hyper-planes in a Euclidean space have been discussed in JS] 

SE systems are an extension of maximum systems (i.e every maximum system is 
SE). Moreover, SE systems seem to possess all "nice properties" of maximum systems 
and some other "nice properties" that maximum systems do not have. (See for example 
Theorem l4.41 i 
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4 Operations on systems 

4.1 Pre-systems and restrictions 

Let § be a system and let C C C(S) be a cube. Consider the structure (5(8) nC,C). 
According to our terminology this structure is not a system. We refer to it as a "pre- 
system". Namely: 

Definition 4.1 (pre-system). Let X be a set. A pre-system is a pair (S,C) where C is 
a cube of {0,1 } x and 5 CC' 

Clearly, every pre-system (S,C') can be translated to a system by restricting its 
vertices to dim(C') as follows: 

Definition 4.2 (Normalizing). Let X be a set and let C C {0, 1} X be a cube. Define 
n:C^ {0,1 } dim ( c) by 

«(/) =/tdim(C) 
This mapping is called the normalizing of C. 

this mapping is naturally extended to subsets of C as follows, Let 5 C C. Then 

def 

n(5) = {«(/) | / G 5}; and to pre-systems. Let § = (S,C) be a pre-system. Then 
n(S) d = (n(5), {0, l} dlm ( c )). Note that the normalization of a pre-system is a system. 

Definition 4.3 (restriction). Let §be a system, let C' C C(S) fee a CMfee. Lef S f c / denote 
the normalization of the pre-system (5(§) HC' ,C). 7%ii system is called the restriction 

o/S to C' 

4.1.1 Restrictions preserve SE 

The following important property of SE systems is an immediate corollary of Lemma 
EH 

Theorem 4.4. Let § be a system, and let C be a cube. Then: 

SE(S) =^ SE(S\ C ) 

The property of "being a maximum system" is not preserved by restrictions. There- 
fore this theorem is an example of the advantage of SE systems over maximum systems. 

4.2 Boolean operations 

Definition 4.5 (Boolean operations). Let A, B be two systems with dim( A) = dim(B) 
X . The union and intersection of A,B is defined as follows: 

1. A W B d = (5(A) U 5(B) , {0, 1 } x ) 

2. A i+i B d = (5(A) n 5(B), {0,1 } x ) 
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Note that if dim(A) ^ dim(B) then these operations are undefined and that both 
tfcl , i+l are associative and commutative binary operations. These two operations and 
the complement operator d2.lt satisfy the relations of the (corresponding) operations 
in a boolean algebra. 

As an example, the following is a trivial variant of the famous De-Morgan's laws: 

1. -i (A W B) = (-.A) i+l (-JB) 

2. -.(A i+l B) = (-.A) 1+) (-.B) 

These operations are used as unary operators in the standard way: Let T be a 
collection of systems such that VS,S' € T : dim(§) = dim(S'). l+J 1 denotes the 
union over all elements of 1 and (+) 1 denotes the intersection over all elements of 
T. 

4.3 The Boolean operators [+| r and |+J y 

Definition 4.6 (|+]y and l+Jj'). Let Y be a set. The following operators, flj^l+Jy are 
defined only on systems § such that Y C dim(S). In this case: 

1. 

(+](§) d = (+| {S\ c | C is a cube with dim(C) = dim(S) - Y} 

Y 

2. 

(+)(§) d = L+J {S\ c | C is a cube with dim(C) = dim(S) - Y} 

Y 

def 

For an operator q G {1+Jj',f+1}'}, let dim(q) = Y. 

The operators , 1+Jy are extensively used under many different notations in the 
literature. Most works noted the duality between these opersators. However, our no- 
tations highlights the fact that this duality stems from the famous duality of Boolean 
algebra. For example, several works (]2|- page 674,(11) and 0- observation 22) ob- 
served the following lemma, Lemma |4771 but didn't notice that this lemma is the famous 
De-morgan laws. 

Lemma 4.7 (De-Morgan). Let Y be a set, then: 

• fly 0-1 = ° Wy 

• Wy a- 1 = -> ° fly 

4.3.1 Analogues of theorems and lemmas 

Definition 4.8. Let S be a system and let x G dim(S). The derivatives of § associated 
with^,are the set {1±){^}(S),|+|^(S)}. 
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Several lemmas and theorems from chapter 2 contain the phrase "let {§',§"} be the 
pair of restrictions of § associated with x". Namely: Lemma [3781 and Theorem l3.14l 
The only properties of the above S',S" that were used in proving these lemmas and 
theorems are: |S'| + |§"| = \S\ and the fact that {S',S"} satisfy Lemma[l8] It is easy to 
verify that the pair of derivatives (associated with x) of a system § also have these two 
properties. Hence these theorems and lemmas hold when the above phrase is replaced 
with the phrase "let {D',D"} be the two derivatives of § associated with x". Hence, 
the Sandwich Theorem may be proved by induction via the derivatives rather than the 
restrictions. An important result that is established this way is the analogue of Theorem 

EH 

Theorem 4.9. Let Sbe a system, let x £ dim(§) and let W , D" be the pair of derivatives 
ofS associated with x. Then the following statements are equivalent: 

1. SE(S). 

2. SE(W) and SE(B") and str(B') ® x str(D") = str(S). 

3. SE(W) andSE{B") and sstr(O') ©jSstrQD") = sstr(S). 

It is quite surprising that these claims concerning W,W are established using ar- 
guments that are symmetric in D' and D". 

4.3.2 (+) y and 1+| F preserve SE 

The following theorem was proven by several authors: 

Theorem 4.10. Let 8 be a system and let Y C dim(§), then 

• (+| y (S) isSE 

• |+J y (§) isSE 

This theorem is easily follows from Theorem |4.9| and the following simple fact. Let 
Y, Y' be disjoint sets, and let a be one of the symbols " f+) ", " 1+J ". Then: 

ay o a! Y = a Y uY' 

4.4 Down-Shifting 

The down shifting operator is often used in the context of shattering. We prefer to 
present this operator in the terminology of oblivious sorting algorithms (See iflOl ). To 
this end, we present a system S by its characteristic function and assume that every 
v € C(S) has a bit b v that is either 1 or according to whether v G S(S) or v ^ S(S). 
Our "algorithms" transform the system S by changing these bits. 

Note that a 1 -dimensional cube contains two vertices. Therefore we refer to such 
cubes as edges and to an x-cube as an x-edge. 

For a set X, we let ^ denote the partial order on {0, 1}* that is the product of the 
order < 1 . 
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Definition 4.11. We say that an edge e is sorted under a system § if: 
e = {u',u"}, u < u" and by? > b u ". 

Note that the order is reversed. This is not a typo; rather, it follows the standard 
definition of down shifting. 

The most basic operation of our algorithms sorts an edge e = {u, v} by permuting, 
if needed, the contents of b u ,b v . Note that this operation mimics a "comparator" (see 
iflOl ) of a sorting network. 

Definition 4.12 (Down-shifting on x). Let § be a system and let x € dim(§). Down 
shifting § on x is the operation of sorting every x-edge. The resulting system is denoted 
dn x (S). 

Lemma 4.13. Let § be a system and let x 6 dim(§). Let {ED', ED"} be the pair of 
derivatives ofS associated with x and let {R',R"} be the pair of restrictions of dn v (§) 
associated with x. Then: 

{M',M"} = {D',D"} 
Definition 4.14. Let §>be a system. 

• We say that § is x-sorted if every x-edge is sorted under §. 

• We say that § is edge-sorted if every edge is sorted under S. 

The following lemma is easy: 

Lemma 4.15. Let § be a system and let x S dim(§) such that S is x-sorted and let 
{§', S"} be the pair of restrictions ofS associated with x. Then: 

1. str(S')©*str(S")=str(S) 

2. sstr(S') ®. v sstr(S") = sstr(S) 

It is sometimes desired to translate our systems to the "classical" format of set- 
system. To this end: 

Definition 4.16. Let S be a system. 

SET(§) ^{/-'(l) | /£<>(§)} 

Observe that a system § is edge-sorted if and only if SET(§) is closed under the 
subset relation. Using this observation is not hard to prove that: 

Lemma 4.17. Let § be an edge-sorted system. Then: 

sstr(S) = str(S) = SET(§) 

"Edge-sorting" a system by applying a sequence of down-shifts is used often in the 
literature. The theory of sorting teaches us how many shiftings are required to make 
the system edge-sorted, as indicated in the next known lemma (See IfTUl lTll): 
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Lemma 4.18. Let A £ K mxn be a 2-dimensional array with keys from a linear ordered 
set K. If the rows of A are sorted, then sorting the columns of A preserves the rows 
sorted. 

This fact is easily generalized to multi-dimensional arrays. In our context, it is 
manifested by the following lemma. 

Lemma 4.19. Let § be a system and let x,y £ dim(S), then dn x (dn y (§)) is y -sorted. 

Definition 4.20. Let Q be a sequence of operators and let x be an object. Let Q(x) 
denote the result of applying the operators in Q on x one after the other. 

Definition 4.21. The phrase "Q is a sequence ofW " means that W is a set and Q is a 

sequence of members ofW in which each member appears exactly once. 

An immediate corollary of Lemma 14.191 is the following fact which was observed 
by several authors: 

Theorem 4.22. Let § be a system and let Q be a sequence of {dn x : x € dim(§)}. 
Then Q(S) is edge-sorted. 

4.4.1 Shifting preserves SE 

The following theorem was proven by several authors: 
Theorem 4.23. Let § be a system such that SE(E>) and let x £ dim(§). Then 

dn x (§) isSE 

Proof. Let M.' and M" denote the two restricitons of dn x (§) associated with x. By 
Theorem l3.141 it is enough to prove that 

1. SE(R') andS£(K"), 

2. str(R') © x str(K") = str(dn r (§)). 

Statement 1 easily follows from Lemma 14.131 and Theorem |4.9l Statement 2 follows 
from Lemma l4.15l □ 
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5 Commutativity between Boolean operators 

Henceforth, "Boolean operators" means operators of the form \+j Y an d (+)y- This chap- 
ter studies the commutativity between Boolean operators and its relations to the SE 
property. 

Since these operators are partial - it is important to note the following: 

• Let/ : A — > B,g : A' — be two operators. The operator fog is always definedQ 

• It is convenient that statements of the form lt e\ = e?" are always meaningful, even 
when e\ orei are undefined. To this end the statement lt e\ = e-f is considered to 
be true either when both sides are defined and equal or when both are undefined. 
In any other case the statement is considered to be false. 

The following lemma summerizes the trivial cases regarding commutativity of the 
Boolean operators. 

Lemma 5.1. Let Y \Y' be sets and let each of 'a, (3 be one of the symbols " (+] ", " [+J ". 

1. IfY H Y' 7^ then OCy ° Pr' = Py' ° a Y (both of them are nowhere defined). 

2. a Y o a Y i = ay o a Y . Moreover: ifYDY' = then ay ° a F / = a YUY i 
It is left to study the case of Y n Y' = 0. 

5.1 Boolean operators commute on SE systems 

Welzl et al have shown in [8 1 the following theorem: 

Theorem 5.2. Let S be a maximum system and let x,y G dim(S) such that x ^ y. Then: 

W M {rf M 
We show the following stronger lemma: 
Lemma 5.3. Let S be a system and let x,y G dim(S) then: 

1. R w oW W (S)2W W ofl w (S) 

2. se(s) => Fi w °y w (s) = y w °Fi w (s) 

Remark: Statement 1 was established by Welzl et al in (8). 

Proof. We consider two cases. 

Case 1: dim(S) = {x,y}. There are exactly 16 systems under this case and it is easy 
to verify that the lemma holds for all of them. 

Case 2: |dim(S)| >2. Let/- % } °l±J {v} , g = |±| {y] op\ {x} andlet v G {0, 1}^-^. 
It is easy to see that there exist an {jc,y}-cube C C C(S) such that: 

/(sr c )=/(§)r H , 

7 If Range(g) n Dom(f) = then fog is the operator which is nowhere defined (i.e Dom(fog) = 0) 
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g(S\ c )=g(S)\ {v} 

Thus, the lemma follows by Case 1 and Theorem l4.4l □ 

The following lemma is an easy corollary of the first conclusion of Lemma 153] 

Lemma 5.4. Let Q be a sequence of operators from {l+l^ijlil^} | x £ dim(§)} such 
that Q(S) is defined. Let Q be a permutation of Q in which all the unions appear at the 
beginning and let Q be a permutation of Q in which all the intersections appear at the 
beginning. Then: 

2(s)ce(§)ce(§) 

Definition 5.5. Let Q be a sequence of operators and let x be an object. We say that Q 
commutes on x if for every Q', a permutation of Q: 

Q(x) = Q'(x) 

From 14. 1 Ol 15 . 1 1 and the second conclusion of Lemma 1531 we get: 

Lemma 5.6. If § is an SE system and Q is a sequence of Boolean operators, then Q 
commutes on §. 

Lemma [531 by itself does not immediately imply 151)1 Namely, having ff|^j and 
1+J{,.} commute on § for each pair x,y G dim(§) does not imply that every sequence Q 
of such operators commutes on S. (See Theorem l7.1U 

5.2 A characterization of Shattering and Strong-Shattering 

To complete the picture, what is left is to study the other direction: when does commu- 
tativity of Boolean operators on § imply SE(S). To this end, we present the following 
characterization of shattering and strong-shattering: 

Theorem 5.7. Let Sbe a system and letY CI = dim(S). Then: 

1. § shatters Y fi\ Y o\+j x _ Y (S>) = % 

2. § strongly shatters Y <J=> l+Jx-r °(+)f(^) = 

Recall that is the non-empty 0-system 
Theorem l5.7l follows from the next two straightforward lemmas. 

Lemma 5.8. For every system §, the following statements hold: 

1- Wdi m( s)(§) = ^ S(S)^® 

2- fldim(S#) = 3G S(S)=C(S) 
Lemma 5.9. For every system § and Y C dim(S): 

1. Testr(S) ^ 5(l±J dim(S) _ F (§))=C(l±J dim(s) _ y (§)) 

2. TGsstr(S) ^ S((+| y (S))^0 
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5.3 A characterization of SE 

Our characterization in Theorem l5.7l combined with Lemma [5T6l gives a new and simple 
proof of the following theorem. 

Theorem 5.10 (0). The following statements are equivalent for any system S: 

1. SE{§) 

2. VY',Y" C dim(S); f+ly/jl+Jr" commute on § 
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6 Commutativity between Shifting operators 

6.1 Shifting operators commute on SE systems 

Lemma 6.1. Let § be a SE system and let x,y £ dim(§) such that x^y. Then dn r and 
dn* commute on S. 

Proof. We consider two cases. 

Case 1: dim(§) = There are exactly 16 systems under this case and it is easy 

to verify that the lemma holds for all of them. 

Case 2: |dim(§)| > 2. let / = dn v odn v , g 1 = dn v odn x . It is enough to prove that 
for every {jc,y}-cube, C C C(§): 



From l4.23l and Lemma loTTI we get: 

Lemma 6.2. If § is an SE system, and Q is a sequence of operators from {dn T | x £ 
dim(§)} then Q commutes on §. 

6.2 A characterization of Shattering and Strong-Shattering 

In this section we present a characterization of str and sstr in terms of shiftings. Recall 
from Definition 14. 2 1 1 that he phrase "2 is a sequence of W" means that W is a set and 
Q is a sequence of members of W in which each member appears exactly once. 

Definition 6.3. For a set X, define: 

fullshift(X) d = {2 : Q is a sequence of {dshiftx ■ ieX}} 
The following theorem is the main theorem of this section. 
Theorem 6.4. For every system S: 



m\ c =g(s)\ c 



It is easy to see that for every such C: 



/(s)r c =/(sr c ), 




□ 



str(§) 



SET( |+) (6(S) 



Qefullshift(dim(S))}) 



2. 



sstr(S) = SET( f+1 (6(S) 



Q e fullshift(dim(§))}) 



The following definition will be useful: 
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Definition 6.5. Let x be an object and i G {0, 1 }. Define: 
Definition 6.6. Lef § be a system and letx G dim(S). 

Lef Cj = {v G C(S) : v(x) = /}, where i G {0, 1} denote the two sub-cubes ofC(§) 

associated with x. 

Define: 



\ x=i = f S\ Ci , ie{0,i} 



The following lemma is straightforward. 
Lemma 6.7. Let x,y be distinct elements and let i,j £ {0, 1}. Then: 

1. The two operators \ x=i and L_; commute. 

2. The two operators \ x=i and dn y commute 

3. \ x=i odD. x = $(x,i) 

Definition 6.8. Let X be a set, let Q G fullshift(X) and let v G {0, 1} X . Define g[ v ] to 
be the sequence of Boolean operators that is derived from Q by replacing every dn v by 
P(jc,v(jc)). 

Lemma 6.9. Let Sbe a system, let v G C(S) an<f /ef Q G fullshift(dim(S)). Then: 

<2(s)r { v } =<2[v](s) 

Proof. Let X = dim(S) and let Z be a sequence of { f f=v ( v ) : x G X}. It is easy to see 
that for every X-system G: 

Z(G)=Gt {v} . 
Thus, Z(<2(S)) = g(S) t{ v }- Also, by LemmaO 

Z°!2 = G[v]- 

Therefore, Z(<2(§)) = (2[ v ] (S). This implies the desired equality. □ 

The following two lemmas finish the proof of Theorem l6.4l 
Lemma 6.10. For every system §: 

str(S)=SET( [+J {£>(§) : Q G fullshift(dim(§))}) 
Proof. Let v G C(S). It is sufiicient to prove that: 



v _1 (l) G str(S) |+| {G(S) : Q G fullshift(dim(S))} t {v} 



Indeed: 
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l+J : eefuUshift(dim(S))}r {v} = 

by the definition of l+J 

l+J {Q{®)\{ v} : Q G fuUshift(dim(§))} = 

by Lemma l6^9l 

l+J {G[ V ](S) : Qefullshift(dim(§))} = 

by Lemma [5~4l 

(+1 ( J+J (§))• 

v-l(l) v-'(O) 

By Theorem l5.7l the lemma follows. □ 

Using similar arguments we also prove that: 
Lemma 6.11. For every system S: 

str(S) = SET( (+| {(2(S) : Q e fullshift(dim(S))}) 

6.3 A characterization of SE 

Theorem 14 . 2 3 1 and Theorem |6.4| imply the following theorem: 

Theorem 6.12 (|[3]|2l). The following two properties of a system § are equivalent: 

1. SisSE 

2. For every Q',Q" G fullshift(S): 

G'(S) = 
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7 A weaker form of Shattering Extremality 

This chapter introduces a weaker form of Shattering Extremality which is denoted by 
k-Shattering Extrmality (or k-SE) where £ is a natural number. We study this property 
via the concept of local-operators which is interesting in its own right. The first section 
introduces the concept of local operators and discusses some basic properties of these 
operators. The second section defines k-SE and proves generalizations of Theorems 
16. 121 and 15. 101 regarding equivalence of k-SE with weaker forms of commutativity of 
Boolean operators and of down-shiftings operators. 

7.1 Local operators 

Let SYS denote the set of all systems. For a system § and Y C dim(S), the collection 
of 7-cubes of C(S) forms a partition of C(§). This partition induces a partition of § in 
which every component corresponds to a restriction of § to a 7-cube. Informally, a Y- 
local operator is a mapping from SYS to SYS whose behaviour on § depends only on its 
behaviour on the restrictions of § to Y-cubes. Surprisingly, all the operators discussed 
in the previous chapters (l+Jy , f+|y , dn T , \ x=i and -i) are "highly-local" (this will become 
formal later). 

Let X be a set. It is convenient here to extend the definition of "C is a 7-cube of 
{0,1 } x " to the case where Y <£. X. So, a new Y cube of {0, \} x when Y % X is an 
X n 7-cube of {0, \} x under the original terminology. It is also useful to extend the 
definition of the operators f v=0 and \ x=l in the following way: For a system § such 
that* £ dim(S), define §[ x=0 = S\ x=l = (0, {0, l} dim(S) ). This way, these operators are 
total operators from SYS to SYS. 

Definition 7.1. Let Y be a set. An operator a : SYS SYS is called Y -local if the 
following statements hold: 

1. a is total. 

2. There exists a set Z CY such that for every system S: dim(a(S)) = dim(S) — Z. 

3. a commutes with the operators \ x= q and \ x=i for all x £ Y . 

The reader might wonder how does this definition captures the concept of local- 
ity. To answer this question, assume, for simplicity, that Y C dim(S) and that Z = 
(namely, for all S: dim(oc(§)) = dim(S)). Note that every restriction of S to a 7-cube 
can be described by an appropriate sequence of operators of { \ x=i : x £ dim(S) — Y}. 
Thus, item 3 in the definition of F-local amounts to: 

For every C, a 7 -cube of C(§): a(S)f c = a(§[ c ). 

Therefore, each component in the partitioning of 0c(§) to the 7-cubes of C(§) depends 
only on the corresponding component of S. Thus, the local behaviour of a on Y- 
systems determines its behaviour on every system. The fact that every F-local operator 
is determined by its behaviour on ^-systems is expressed in the following straightfor- 
ward lemma. 
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Lemma 7.2. Let a and P be two Y -local operators and let § be a system. Then the 
following statements are equivalent: 

1. a(S)=P(S) 

2. For every Y -cube o/C(S), C: a{§\ c ) = p(S|" c ) 

Example 7.3. The operator -> and the Identity operator are %-local with Z = 0. 

Example 7.4. Let x be an object. The operators \ x= q and \ x= \ are {x]-local with 
Z={x}. 

Example 7.5. The operator dn x is not total since it is not defined on systems § with 
x ^ dim(§). Thus, henceforth this operator is extended to such systems by dn r (§) = 
(0, {0, l} dim(s) ). Under this modification, dn A is {x}-local with Z = 0. 

Example 7.6. The operator 0C{ V } where a is one of the symbols ' l+J ' or ' (+) ', is not 
defined on systems S/or which x £ dim(§). Thus, henceforth this operator is extended 
to such systems by OCr^S) = (0, {0, l} dlm ( § )). Under this modification, ar^j is {x\- 
local with Z = {x}. 

The following lemma is straightforward. 

Lemma 7.7. Let X and Y be sets and let a be an X-local operator and P be a Y -local 
operator. Then OLofi is X U Y -local. 

Using Lemma 17771 we can compute the locality of sequences of local operators. 

Let Q be a sequence of Boolean operators. We say that Q is meaningful if every 
two distinct operators, q',q", in Q satisfy^ dim(g') ndim(</') = 0. Note that if Q is a 
sequence of Boolean opearators which is not meaningful then the composition of the 
elements of Q is the operator which is nowhere defined. Thus, all the theorems and 
lemmas we have proved regarding sequences of Boolean operators remain true if we 
consider only meaningful sequences. For a sequence of Boolean operator Q, define 

dim(g)= / U dimfe) 
q is in q 

Thus, by Lemma 17771 we obtain: 

Example 7.8. Let Q be a meaningful sequence of Boolean operators. Then Q is 
dim(Q)-local. 

Similarly, we have for down-shifting operators: 

Example 7.9. LetX be a set and let Q 6 fullshift(X). Then Q is X-local. 

7.2 k-SE 

Definition 7.10. Let § be a system and let k € N. S is called k — SE if: 
For every cube o/C(S), C with |dim(C)| < k: SE(S\ C ) 
We will refer to the statement "S is k ~ SE " by "SE^(S) " 
8 Recall from Definition f46]that dim((+l r ) = dim(|+J y ) = Y. 
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7.2.1 k-SE and Boolean operators 

The following theorem is the generalization of Theorem l5. lOl concerning k-SE. 

Theorem 7.11. Let § be a system and let k £N. The following statements are equiva- 
lent: 

1. Every meaningful sequence of Boolean operators Q with |dim(<2)| < k commute 
on §. 

2. For all disjoint Y',Y" such that \Y' UY"\ < k: (+] F / and \+} Y " commute on §. 

3. SE k (S). 

Proof. 1 <^=> 2 is a simple corollary of Lemma [5T4l 

2 <^=^ 3: By Lemma [7721 and Example ITU Item 2 is equivalent to the following 
statement. 

For every C, a X-cube of C(§) where \X\ <k: (+] F and \*}x-Y commute on S \ c 
By Theorem |5.10l the above is equivalent to &E&(S). □ 

7.2.2 k-SE and down-shifting operators 

The following theorem generalizes Theorem l6.4l 

Theorem 7.12 (Refined commution of down-shifts). Let S be a system and let k G N. 
The following statements are equivalent: 

1. For all Y C dim(§) such that \Y\<k and for every Q', Q" € fullshift(F): 

e'(s) = e"(§) 

2. SE k (§) 

Proof. The proof easily follows Lemma l7.2l Theorem l6.4l and Example |7.9l □ 
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8 Arrangements of hyperplanes in a Euclidean space 

Let H be an arrangement of hyperplanes in a Euclidean space, V = W 1 . For each 
hyperplane e £ H let one half-plane determined by e be its positive side and the other 
its negative side. The hyperplanes of H cut V into open regions (cells). Corresponding 
to each cell c there is f c £ {+, — } H such that 



In this chapter we analyze the system §# = ({fc '■ c is a cell of H}, {+, — } H ). 
Section [8T1 introduces preliminaries. Sections [8 .21 and [iOl prove the following charac- 
terizations of str(S#) and sstr(S#): 

(i) X C H is shattered by E>h if and only if the normal vectors of the hyperplanes in 
X are linearly independent 

(ii) X is strongly-shattered by Sh if and only if in addition to [(I)] there is no hyper- 
plane heE -X such that C]X C h. 

Note that if H is an arrangement of lines in the plane and {h,h} QH then the first 
property amounts to "l\ and h are not parallel" and the second property amounts to 
"The intersection point of l\ , li is simpleO in H". 

Section [8T4l demonstrates a usage of the above characterizations of str(Sff), sstr(§/ ? ) 
for counting the number of cells of some arrangements. Section [831 discusses a certain 
generalization of such systems and presents some open questions. 

Figure Q] gives an example of an arrangement of 4 lines which partition the plane 
into 10 regions. The positive side of each line in the arrangement is shaded. 



'there is no other line in H that is incident to this point 




+ if c is in the positive side of e , 
— if c is in the negative side of e . 
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Figure 1 : An arrangement of (oriented) lines in the plane. 



8.1 Notations and preliminaries 

Let V = M. d be a Euclidean space. A linear hyperplane, is a (d — l)-dimensional sub- 
space of V, i.e., 

{vGV : a-v = 0} 

where a is a fixed nonzero vector in V and 6c • v is the usual inner product. An affine 
hyperplane, or simply a hyperplane, is a translate of a linear hyperplane, i.e. 



{veV 



r} 



where n is a fixed nonzero vector in V and r is a fixed scalar, n is called a normal of 
the hyperplane. Note that the normal is unique up to scaling. 

An (open) half-space is a set {v G V : a • v > c} for some a G V, c e R. If e is a 
hyperplane in V, then the complement V — e has two components each of which is an 
open half-space. It is convenient to associate every (oriented) hyperplane e CV, with 
a vector n e G V and a number r ( el such that: 

1 . e = {x G V : n e -x = r e } 



2. The positive side of e is the set {x G V : n e -x>r e }. 
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For a vector v G V, let sign(v) G {+,0, — } d be such that, for all i < d: 

ifviX), 

sign(v); \ ifv,- = 0, 
if v,<0. 

For A C V, let c/j(A) denote the convex hull of A. 

Let H be an arrangement of hyperplanes in V. We use f]H and to denote the 
intersection and union of the hyperplanes in H. 

Definition 8.1. H is called independent (dependent) if the set {n e : e G H} is an 

independent (dependent) set of vectors. 

Lemma 8.2. IfH is independent then f]H 7^ 
Proof. Let x G V . By the definition of n e : 

x G f^\H Ve G H : n e ■ x = r e . 

Thus it is enough to prove that there exists a solution, x, for the linear system: 

n e -x — r e , Ve G H. 

Independence of the n e 's implies that there exists such an x. □ 
Definition 8.3. Let Y C H. Y is called //-regular ;/ 

Y = {eeH : f|yc«}. 

Lemma 8.4. Let Y C H such that f] Y =/= and let h be a hyperplane such that f] Y C h. 
Then 

Hh G span{n e : e G Y}. 

Proof. Let U = span{n e : e G Y}. It is easy to verify that nj, G (U ± ) ± = U. □ 
Lemma 8.5. Let Y C E be independent and H-regular. Then: 

f)Y-{J(H-Y)^<t>. 



Proof. By Lemma [8~4l there exists no e G H — Y such that f]Y C e. Thus, every e G 
H — Y intersects f] Y in a proper affine sub-space, A e , of f) 7. Thus, {A e : e G H — Y} 
is a finite collection of proper affine sub-spaces of f] Y and thus, by Baire Theorem, 
cannot cover it. □ 
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8.2 Geometric interpretation of shattering 

Let H be an arrangement of hyperplanes in WL d and let Sh be the corresponding system. 
This section proves and discusses the following theorem: 

Theorem 8.6. LetX C H. Then the following statements are equivalent 

1. §h shatters X. 

2. X is independent. 
Lemma 8.7. LetX C H. Then 

X is independent => Sh shatters X. 

Proof. It is enough to show that for every f £ {+, — } x there exists some cell c such 
that f c and / agree on X. Let / £ {+, — } x . Define 

dg \r e + l iff(e)= + 
ye \r e -l if/W = - 
Since X is independent the linear system 

n e ■ x = y e , e EX 

has a solution in M. d . Let c denote a cell whose closure, c, contains such a solution. 
Then f c and / agrees on X as required. □ 

Proving the other requires more work. We split it to two parts; first, it is shown that 
if X C H is shattered by Sh then f]X ^ 0. In the second part we establish that if Sh 
shatters X then X is independent. 

For the first part, the following lemma is useful. 

Lemma 8.8. Let k £ N and let A C R* be a set that satisfies: 
(V/£{+,-}*)(3v£A): sign(v)=/. 

ThenO ech(A). 

Lemma [878l is easily proved by induction on k. 
Lemma 8.9. Let X C H be a non-empty set. Then 

Z£str(§ H ) =>• P|X^0. 

Proof. Let A £ R^ Xi/ be a matrix with the n* e 's in X as its rows and let r £ be a 
vector with the r c 's as its entries. It is enough to show that there exists x such that 
Ax = r. 

Let T denote the map x M> Ax — r. T is an affine map. Since Sh shatters X, it 
follows that: 

(V/ £ {+,-} x )(3x £ R d ) : sign(Ax-r) =/ 
Let A C Mr be a set that for every / £ {+, — } x contains Xf £ M. d for which sign (Ax — 
r) = f. Therefore, according to Lemma [8~8l it follows that £ ch{T(A)). Since T 
is affine, we have that ch(T(A)) = T(ch(A)). Thus, there exists x £ ch(A) such that 
T(x) = 0, which means that: Ax = r as required. □ 
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Lemma 8.10. Let X C H. Then 

Sh shatters X ==> X is independent. 



Proof. By Lemma [£T9| we may assume that for all e 6 X: r e = 0. (Otherwise, trans- 
late M 1 ' such that the origin is in C\X) Let {a e : e £ X} be coefficients such that 
Y,eex V-en e = 0. It is enough to show that Ve G X : a e = 0. 
Let / £ {+, -} x such that 

,/ f+ ifa, >0 
I — otherwise 

Since shatters X there exists x G K f/ such that 

sign(x-«;) =/(e) 
Furhtermore, we may choose x such that Ve < X : \x-n e \ > I. 

= x ■ ^ a e«e since a e"e = 

> ^ a e — ^ a e by definition of x 

a e >0 a e <0 



Therefore: 



£ a e - £ a e = 0. 

a e >0 a E <0 



This is only possible if Ve G X : a e = 0. □ 

8.3 Geometric interpretation of strong-shattering 

Let H be an arrangement of hyperplanes in W 1 and let be the corresponding system. 
This section proves and discusses the following theorem: 

Theorem 8.11. Let X C H. Then the following statements are equivalent 

1. §h strongly- shatters X. 

2. X is independent and H- regular. 
Lemma 8.12. LetX C H. Then 

X is independent and H- regular => strongly-shatters X. 



8 ARRANGEMENTS OF HYPERPLANES IN A EUCLIDEAN SPACE 



31 



Proof. According to Lemma [831 there exists p G f]X — U (H — X). Therefore: 

WeeH-X: \n e -p-r e \ > 0. 
Define g G {+,-} H ~ X to be: 

g(e) d = siga(n e -p-r e ) 

It is enough to show that for all / G {+, — } x there exists a cell c such that f c = g*f. 
Let/G {+,-}*. Define 




Consider the linear system 

n e -x = y e for all e G X 

Since X is independent, this system has a solution <f . 
Pick e > sufficiently small such that 

Ve G E — X : sign(n e • p — r e ) — sign(« e ■ (p + ed) — r e ). 

Let c denote the cell that contains p + zd. By the definitions of e and d it follows that 
fc=8*f as required. □ 

For the other direction: 

Lemma 8.13. Let X C H. Then 

Sfl strongly- shatters X =>■ §// w independent and H- regular. 

Proof. We will prove the contra-position of this claim. If X is dependent then §# does 
not shatter X. In particular, this means that Sh doesnt strongly-shatter X. Otherwise, 
assume X is independnet and H-irregular. Therefore, by Lemma [8^91 0^ ^ 0. More- 
over, we may assume that G C\X (by translating the space if needed). 

Since X independent and //-irregular, it follows that there exists h G H —X such 
that Q h. Thus, by Lemma [8^41 it/, G span{/v : e G X}. which means that there 
exist coefficients {a e G K : e G X} such that 

£ a,,« e = «/,. 

Note that G f]X C /; implies that r/, = 0. Assume, by way of contradiction that Sh 
strongly-shatters X. Let 

g(e) d — i + if(Xe - 
1 — otherwise 

and 

-g{e) d -l + if§ ^ = ~ 
1 — otherwise. 

Since X is strongly shattered by Sh, there exist two open cells c',c" such that: 



8 ARRANGEMENTS OF HYPERPLANES IN A EUCLIDEAN SPACE 



32 



(i) f(j agrees with g on X 

(ii) f c n agrees with — g on X 

(iii) f c i agrees with f" on H — X 

We will derive a contradiction to item |(iii)| by showing that f c J (/?) ^ f c ii (h). 
Let x 1 G c' and x" e c". 

fc'{h) = sign («/,•*') 

= sign( [ £ a e «;] • x') £ a «"e = «/, 

= sign(^ a e [« e -jc']) 
eex 

= + by definition of f c i : a e [n e -x'] > 

Similarly we have / c "(/j) = — and we are done. □ 

8.4 Counting cells 

The following lemma is a corollary of Theorems 18. 61 [8. 1 H and Lemma [84l 

Lemma 8.14. Let H be an arrangement of hyperplanes in M. d . Then, the following 
statements are equivalent. 

1. SE($ H ) 

2. yx C H: X is independent <*=^ C]X / 0. 

Let H be an arrangement of hyperplanes such that SE(E>h). By Theorem 13. 101 this 
means that |S| = |str(S)|. Thus, by Lemma [8.14| and Theorem |8.6| we get the following 
result: 

Lemma 8.15. Let H be an arrangement of hyperplanes in R d such that 

MX C H: X is independent ^ - 

Let C denote the number of cells in the arrangement. Then: 

C = \ {X C H : X is independent} \ . 

in particular, if the arrangement H is in a general position (i.e. every d hyperplanes 
intersect in a single point and no d + 1 hyperplanes intersect), then it is easy to verify 
that H satisfies the premise of Lemma l8.15l and that 

{XCH : X is independent} = {XCH : \X\ < d}. 

This proves the well known result regarding the number of cells, C, in such an arrange- 
ment. Namely C = £f =0 ('f ). 
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8.5 Convex systems and some questions 

Let S be a system. S is called Euclidean if there exist a Euclidean space, W 1 , and an 
arrangement of hyperplanes H such that § = S#. 

Let H be an arrangement of hyperplanes in Mr and let be the corresponding 
system. Let e 6 H and let be the two restrictions of associated with x. 

It is easy to see that E> H W corresponds to the arrangement H' = H — {e} and that 
S' H l±l corresponds to the arrangement H 1 (of one dimension smaller) obtained as the 
intersection of e with the remaining hyperplanes. Thus, Euclidean systems are closed 
under Boolean operators. It is not hard to prove that Euclidean systems are not closed 
under restrictions. However, the following generalization enables the representation of 
E' H , §^ in a geometric setup. 

Let K be an open convex set in M. d . Let 

&H,K=({fc ■ c is a cell of//, cC\K ^0)}, {+,-}"). 

It is easy to see that choosing K to be one of the half-spaces determined by e yields a 
representation of S' H and §^ as §>h.k- 

This setup generalizes the setups investigated by Lawerence in lfl2l and by Welzl 
et al in JS) . The systems studied in IT2l are obtained by choosing H = {{x : xi = 
0} : 1 < i < d\ and K to be arbitrary and the systems studied in (8) are obtained by 
choosing H to be any general -position arrangement and K = M. d . 

The following two theorems can be proved similarly to the way in which we proved 
theorems EH] ES 

Theorem 8.16. Let X C H. Then the following statements are equivalent 

1. S>h,k shatters X. 

2. X is independent. 

3. f]X and K share a common point. 

Theorem 8.17. Let X C H. Then the following statements are equivalent 

1. Sh.k strongly-shatters X. 

2. X is independent, H-regular. 

3. f]X and K share a common point. 

The above characterizations of str, sstr in this geometric setup yields a big class 
of SE systems that rise from arrangements of hyperplanes in a Euclidean space. It 
is natural to ask whether every SE system can be implemented in such a way. The 
following is a partial answer to this question. 

Theorem 8.18. There exists an SE system § such that there exists no arrangement of 
hyperplanes, H, such that § = S#. 



Proof. By Theorem l8.6l for every H, it follows that str(S#) is a matroid. It is easy to 
construct an S/i-family § such that str(s) is not a matroid. □ 
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However, the answer to the following question is not known. 

Question 8.19. Let § be an SE system. Is there an arrangement H and a convex set K 
such that § = Sh,k ? 
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9 Orientations of an undirected graph 

This chapter presents a method which uses the tools we developed in order to prove 
certain equalities and inequalities involving undirected graphs and their orientations. 
Unlike most applications of these concepts, which use the "shattering" relation, this 
one demonstrates a usage of "strong-shattering". The method is demonstrated via two 
examples. 

Henceforth, let G — (V,E) be an arbitrary fixed undirected simple graph. We rep- 
resent an orientation of £ as a function £/:£—!>{+,—}. In order to enable this, pick 
a fixed orientation 1? of E. Thus, a function d :£—>{+, — } define an orientation of 
E, relative to E*, in the obvious way: If d(e) = + then e is oriented the same way as in 
~Tl . Else, if d(e) = — then e is oriented in the opposite way as in if. For an orientation 
d, let Gd denote the directed graph obtained by orienting E according to d. 

clef 

For X C E, let Gx = (V,X) denote the undirected subgraph of G with X as its 
edges. 



9.1 Cyclic orientations 

The first example proves the following inequality: 

Theorem 9.1. The number of orientations of G that yield a directed cycle is at least 
the number of its subgraphs that have an undirected cycle. 

Consider the system § d = {{d G {+, — } E : Gd has a directed cycle}, {+, — } E ). 
The main step in this method is to characterize sstr(S) or str(S) (or both). In this 
example, the following lemma characterize sstr(S). 

Lemma 9.2. Let X C E. Then X E sstr(S) Ge-x has a cycle 

Proof. If there exists a cycle, C, in E — X then there exists an orientation of E —X, d, 
such that C is oriented to a directed cycle. Clearly, every extension of d to an orientation 
of E contains this directed cycle. This means thatX e sstr(S) 

For the other direction, assume that E—X doesn't contain a cycle. To establish that 
X sstr(S), it is enough to show that: 

(yd' :(E-X)-> {+, -})(3d" : X -> {+,-}) : G d i* d » has no directed cycle. 

Let d' : (E—X) {+,—}. Since E—X has no cycles, it follows that the orientation 
of Ge-x according to d! is a DAG whose edges form a pre-order, P, on V. Pick (by 
topological sorting) a linear order L of V that extends P. Let d" : X — > {+, — } be the 
orientation of the remaining edges accordin£0 to L. Clearly, the resulting orientation, 
d'-kd", is a-cyclic. □ 

Theorem l9.ll easily follows from the Sandwich Theorem (Theorem l3.3l ). It is easy 
to see that in some graphs the inequality is strict, as observed by taking G to be a simple 



every edge is oriented towards the bigger vertex 
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cycle. Indeed - for such G there are two possible orientations that yield a directed cycle 
but only one X C E such that Gx has a cycle. 

Consider the system -i§, namely the orientations of G that yield an a-cyclic graph. 
By Lemma |Z91 it follows that 

Lemma 9.3. LetX C E. ThenX E str(-iS) Gx is a forest . 

This, combined with the Sandwich Theorem, gives the following inequality: 

Theorem 9.4. The number of orientations of G that yield an a-cyclic graph is at most 
the number of its subgraphs that are forests. 

These characterizations of str(-i§) and sstr(§) also give a natural interpretation of 
the VC-dimension and the Dual VC-dimension. The dual VC-dimension of a system is 
obtained by applying the Duality Tranformation on the text defining the VC-dimension: 

Definition 9.5 (Dual- VC-dimension). The Dual VC-dimension (Vapnik Chervonenkis 
dimension) of a system S, denoted dvc(§), is the cardinality of the larges^^ subset that 
is strongly-shattered by it. 

The next lemmas easily follows from the characterization of sstr(S) and str(-iS). 

Lemma 9.6. Let |V| = n and let \E \ =m. Then 

1. vc(-i§) = k where k is the size of a maximum subforest of G. 

2. dvc(§) = m — c, where c is the size of a smallest cycle in G. 

9.2 Path preserving orientations 

The second example will demonstrate a proof for the following equality: 

Theorem 9.7. Let ffCV and let seff. Then, the number of orientations of G in 
which all vertices in W are reachable from s equals to the number of subgraphs ofG in 
which all vertices ofW lie in the same connected component. 

Let § denote the system ({d g {+, — } E : Vw eW :Gd has ans-w directed path}, {+, 
The following lemma characterizes str(S) and sstr(S). Moreover, it shows that SE(S). 

Lemma 9.8. Let X C E. Then, the following three statements are equivalent: 

1. § strongly shatters X 

2. § shatters X 

3. All vertices ofW lie in the same connected componenet ofGE-x 
11 As a special case, dvc(§) = -1 when S(S) = 
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Proof. 1 ==> 2 is trivial. 

Consider 2 3. To establish this implication, we prove its contra-position, 

namely we assume that not all vertices of W lie in the same connected component of 
E — X and we show that there exist d £ {+, — } x such that no orientation in § agrees 
with d on X. Let w £ W such that w and s lie in different components of E — X and let 
S denote the connected componenet of S . Let d denote an orientation of X in which 
w is not reachable from s (to obtain such an orientation, orient all edges connecting S 
and V — S towards S). Clearly, every orientation of E, that agrees with d on X has no 
directed path from S to V — S and in particular no directed path from s £ S to w £ V — S. 
Thus, every such orientation is not in S. 

Consider 3 ==> 1 . Assume that all vertices of W lie in the same connected com- 
ponenet of Ge-x- Let d be an orientation of E —X such that s is a root of its connected 
component. Note that in particular this means that every w £ W is reachable from s. 
Clearly, in every extension of d to an orientation of E, every w £ W is reachable from 
s. □ 

Similarly to the first example, the Sandwich Theorem establishes Theorem |9.7| 
The following lemma is an easy corollary of Theorem 19. 8 1 

Lemma 9.9. vc(S) = m — t where t is the size of a Steiner tree for W. 

Consider the case in which W = {s,t}. In this case, vc(S) = m — d where d is the 
size of a shortest s — t path in G. Moreover, ->§ is the set of all orientations that do not 
contain an s — t directed path, and by Lemma |2~9l we obtain: 

Lemma 9.10. The number of orientations that do not contain a directed path from s 
to t equals to the number of subsets of E that separates s and t. Moreover, vc(-iS) is 
m — c, where c is the size of a minimum s — t cut. 

9.2.1 An interesting class of SE systems 

In the preceding chapters we showed that SE systems are preserved by many operations 
(e.g complementing, restrictions, Boolean operators,...). A natural question is to ask 
whether the class of SE systems is closed under union and intersection. It is not hard 
to design to SE systems, §' and S" that demonstrates that it is not the case. However, 
there are some subclasses of SE systems that are closed under these operations. One 
trivial example is the class of all edge-sorted systems. In this subsection we introduce 
another such class. 

Pick some vertex s £ V and let v £ V — {x}. Let S s v be the system of all orientations 
which yield a directed path from s to v. Consider the class 

{§,,„ : v£V-{s}} 

By Lemma |9~8l it follows that every system in this class is SE. Moreover, by the same 
lemma we get that any intersection of systems in this class is SE. It is not hard to prove, 
using the same method, that every union of elements in this class is SE. Therefore, this 
forms a non-trivial example for a class of SE systems that is closed under unions and 
intersections. 
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